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The invention relates to a method of calibrating a first microphone and a 
second microphone, comprising 

an acquisition step in which a first input audio signal is acquired by means of 
the first microphone and a second input audio signal is acquired by means of the second 

5 microphone; 

a caUbration step in which a first sensitivity of the first microphone and a 
second sensitivity of the second microphone is detennined. 

The invention also relates to an apparatus comprismg a first mi<»ophone and a 
second microphone for acqmring a first and a second input audio signal respectively, and a 
10 processor for determining a first sensitivity of the first microphone and a second sensitivity of 
the second microphone. 

The invention also relates to a computer program for execution by a processor, 
comprising program code for calibrating a first microphone and a second microphone, 
conqnising 

15 . an acquisition sl»p in which a first input audio signal is acquired by means of 

the first microphone and a second ioput audio signal is acquired by means of the second 
microphone; 

a calibration step in which a first sensitivity of the first microphone and a 
second sensitivity of the second mi<ax)phone is determined. 
20 The invention also relates to a data carrier storing a computer program for 

execution by a processor, comprising program code for calibrating a firstmiaraphone and a 
second microphone, which method comprises 

an acquisition step in which a first input audio signal is acquired by means of 
the first microphone and a second input audio signal is acquired by means of the second 
25 microphone; 

a calibration step in which a first sensitivity of the first microphone and a 
second sensitivity of the second microphone is detennined. 
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An apparatus for calibrating microphones is known from WOA-0201915. The 
known apparatus has a multitude of microphones and is useful for e.g. teleconferencing. The 
multitude of microphones enables better capture of the speech of a speaker, which leads to 
higher inteUigibility at the receiver side. Algorithms for exploiting the multitude of 
microphones require an accurate cahbration of the microphones. This can be done in the 
factory in an anechoic chamber, hut this is expensive. The known app^r^fic p^^^. , 
cahbration after purchase, which enables the connection and the cahbration of additional 
microphones on the fly. The disadvantage however is that the sensitivity of the microphones 
is detemiined as the relation between a predetermined acoustical iiq)ut signal appUed to the 
microphones and a measured electrical output signal fiom the microphones. 



15 



20 



25 
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It is a first object of the invention to provide a method of the kmd described in 
the opening paragraph, for caUbrating at least two microphones, which is versatile in its use. 

It is a second object of the invention to provide an apparatus of the kind 
described in the opening paragraph, which is versatile in its use. 

It is a third object of the invention to provide a computer program for 
execution on a processor comprising program code coding the method according to the 
invention. 

It is a fourth object of the mvention to provide a data earner storing a 
computer program according to the invention. 

The first object is realized m that in the calibration step an algorithm is ^hed 
which enables detemimation of the sensitivities, m the absence of a loudspeaker for the 
generation of the input audio signals. To detemiine the sensitivities in the known apparatus, a 
loudspeaker is required, which emits aprespecified sound, which serves as the acoustical ' 
iapnt for the microphones. In the method of the present mvention the calibration is performed 
by usmg an algorithm which allows the microphones to be calibrated with naturally present 
sound, such as speech from a person- e.g. the person executing th^ method- or a sound picked 
up on the street. This makes the method, and the apparatus applying the method, more 
employable to practical usage cases, since it avoids carrying around a loudspeaker. 

In an embodunent of the method, the first and the second input audio signal 
are processed by an adaptive beamforming filter, and the sensitivities are determined by 
performmg a calculation with weights of the adjq)tive beamfomung filter. Beamforming is a 
widely used algorithm for obtaining an mcreased sensitivity in the direction of a speaker. 
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and/or a reduced sensitivity in the direction of a source of noise, by making use of the input 
signals of a number of microphones. In the embodiment, use is made of the fact that the 
sensitivities of the microphones can be inferred from the coefficients of the filters used by the 
beamformer. 

In a more specific ^bodiment of the method, the algorithm comprises 

calculating 

^JL2|j7^(q^)|^ , in which is the discrete Fourier transform of the weights of the 

beamforming filter after adaptation, and the sum ranges over a predetermined number L of 
frequencies . Performing a calculation in the Fourier domain makes the detennination of 

the sensitivities more robust. 

The second object is realized in that the processor is able to determine the 
sensitivities, in the absence of a loudspeaker for generating the input audio signal. Often the 
microphones are integrated in an apparatus which is able to calibrate itself, such a 
teleconferencing apparatus. 

The third object is realized in that in the calibration step an algorithm is 
applied which enables detennination of the sensitivities, in the absence of a loudspeaker for 
generating the input audio signals. 

Tlie fourth object is realized in that in the calibration step an algorithm is 
applied which enables determination of the sensitivities, in the absence of a loudspeaker for 
generating the input audio signals. 



These and other aspects of the method, the apparatus, the computer program 
and the data carrier according to the invention will be apparent from and elucidated witii 
reference to the implementations and embodiments described hereinafter, and with reference 
to the accompanying drawings, which serve merely as a non limiting illustration. 

In the drawings : 

Fig. 1 schematically shows a teleconferencing session^ 

Fig. 2 schematically shows the method of calibrating a first and a second 

microphone; 

Fig. 3 schematically shows a beamforming apparatus; 

Fig. 4 schematically shows a naicrophone calibration apparatus of the prior art; 
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Fig. 5 schematically shows an ^aiatus for relative calibration of a first and a 
second microphone according to the invention; and 
Fig. 6 shows a data carrier. 

In these Figures elements drawn dashed are optional depending on the desired 

5 embodiment. 



In Fig. 1, a telecoxiferendng session is shown. A locally present person 107 is 
communicating with a remote person 109, who is e.g. shown on a display 1 1 1. For the 
10 communication of speech, an audio communication device is required, which is represented 
by the console 101. It can contain e.g. buttons, a small status display, a loudspeaker for the 
reproduction of speech uttered by the remote person 109, and a microphone. Practice shows 
lhat the locally present person 107 has to be very close to the microphone in the console 101, 
if he wants that the remote person 109 understands what he is saying. It is much more 
15 practical if the locally present person 107 can reside anywhere he likes. To achieve this, more 
than one microphone is used, illustrated with the first microphone 103 and the second 
microphone 105 in Fig. 1. Techniques have been developed to take advantage of the spatial 
arrangement of multiple microphones, in order to better capture the speech of a speaker. The 
beamforming technique is explained by means of Fig. 3. There are numerous appUcations 
20 which benefit fi-om beamforming and more particular firam tiie m^d for tiie relative 

caUbration of at least two microphones desiaibed m tihds text. One example is voice control. 
E.g. a television set could be equipped with a remote control based on keywords. 
Beamforming helps in decreasing the keyword recognition faikire rate. Also portable devices 
can be equipped with more than one microphone. 
25 In Fig. 2, a first input audio signal ul, coming &om a first microphone 205, 

and a second input audio signal u2, coming Scorn a second microphone 207, are acquired 
during an acquisition step ACQ. Botii input audio signals ul, u2 are used in the caHbration 
step CAI. to determine a first sensitivity al of the first microphone 205 and a second 
sensitivity a2 of the second microphone 207. 
2® Pig- 3 shows an apparatus 241 which is able to apply filtered-sum 

beamforming to the output of a number of microphones, e.g. tiu-ee, A first sound source is 
speech firom speaker 201. Suppose that the speech contains a single wavelengfli, and tiiat a 
speech wavefront 233 is planar and parallel to an unaginary Ime running through tiie first 
microphone 205 and tiie second microphone 207. Bach microphone tiien picks iq) tiie same 
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sound signal. The iSrst microphone 205 converts the sound into a sampled first electrical 
audio signal ul, and the same appUes to the second microphone 207 and fiirfher microphones 
if present. If the sensitivities of the microphones 205 and 207 are equal, the sampled 
electrical audio signals ul and u2 are equal. Suppose further that a second sound source 203 
produces e.g. music of a single wavelength, with planar wavefronts which impinge on the 
microphone array under an angle G. Then a music wavefiront 23 1 arrives at the first 
microphone 205 earlier than at the second microphone 207. This impUes that the electrical 
audio signals ul and u2 are samples at different phases of a smusoid of a certain spatial 
wavelength , which is related to the direction of incidence 9 and the wavelength % of the 
music of the second sound source 203. A highpass spatial filter can be designed which 
transmits the speech from speaker 201 with an infinite spatial wavelength ^ , but blocks the 
undesired interference fix)m the second sound source 203. A spatial filter consisting of a 
single multipUcation coefficient for each microphone is sufficient for fixed position, single 

wavelength sound sources. 

For broadband sound sources onitttng more wavelengths, a tonpoial filter is 
placed behind each microphone instead of a single multiplication coefficient. E.g. a first 
temporal filter 221 filters the electrical signal ul of the first microphone 205. Successive 
samples of ul are delayed by delay elements, like a first delay element 227, and the delayed 
samples are multipUed by filter coefficients, like a second filter coefficient 228, and added 
together by adders, like a first adder 229. The number of filter coefificients is^ep^identoa ...... 

how many samples from a sound signal axe desfrable and on how many computing resources 
are available. The outputs of the temporal filters 221 and 223 are summed by the spatial 
summation 230, to obtain a spatiotemporal filter output z. Spatiotemporal filter 240 can be 
described mathematically by means of equation [1]: 

m-l M 

In equation [13, describing a filtered sum beamformer, n is a discrete time 
index, 1 is an index of a filter coefficient w, T is a time difference between samples, and m is 
a microphone index corresponding to one of the microphones (205 and 207) and temporal 
filters (221 and 223). 

In order to properly filter out the audio signal of the interfering second sound 
source 203, the filter coefficients have to get the appropriate values during an adaptation 
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phase of beanrfoiming. If adulation appHes e.g. an algorithm which maximizes the power of 

z(n),imder1hecoiistraintthatforallfrequencies Ci, the foUowing condition is satisfied : 
l;K("*)f =C(Q,)^ m 

in which W„ (Q^ ) is the discrete Fourier transform of the filter w„ (n) and C a 

>nstmVthen^e^ptimal^terxoef5cientsTifta-gd^^ tion satisty equation l^ J: 

r;(Q,) = a(Q,)C(Q,) [3]. 

In equation [3], H'^ (Q^) is the complex conjugate of the discrete Fourier 
transform of the acoustic impulse response for Ihe microphone wifli index m. E.g. the first 
acoustical inq>ulse response hi in Fig. 2 is the acoustic inq)ulse response modeling flie sound 
transfer fiom the speaker 201 to the first microphone 205. a(Q,)isanall.passtetmwhichis 
common to all tenq)oral filters 221, 223 and 225. 

For a planar wavefiront like the music wavefront 23 1 , the acoustic transfer 
functions are propagation delays, and hence for each firequency k and microphone index m, 
equation [4] ^plies: 

15 |^r„(Q,)|^=l 

In a reverberant room fliis model is too simple. The sound traveling directly 
fi»m e.g. speaker 201 to e.g. the first microphone 205. can interfere constructively or 
destructively with e.g. a first reflection of the sound fi-om speaker 20 1 on a nearby wall. This 
can imply that e.g. at the position of the first microphone 205 there is hardly any sound 
power present at fi-equency . It is very unlikely that the interference should occur for all 
possible firequencies at the spatial position of a microphone, e.g. the first microphone 
205. Hence equation [5] is highly likely to be vaKd: 

»1 [51. 
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20 



25 



Using equation [5], which guarantees that the sound from speaker 201 is 
transferred approximately equally to the microphones 205 and 207, it can be proven that the 
relative sensitivities a„ of each microphone follow from equation [6]: 



[«!• 
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Hence it is possible to introduce correction factors - 21 1 and 213 in Fig. 2 - 
which multiply with the electrical audio signals ul and u2, in order to make the microphones 
205 and 207 equally sensitive. These correction factors can be calculated as in equation [7]: 

nt 

in which c is a constant. 

The spatiotemporal filter 240 implementing filtered sum beamformer [3] 
identifies the acoustic transfer fimctions between a sound source measured during calibration 
and the microphones upto an unknown error factor which is common to all microphones. The 
feet that the error factor is common to all microphones allows calibration of the microphones 
relative to each other. 

If the speaker is a certain distance away firom the microphones^ e.g. sittmg in a 
chair watching a television with multiple microphones for voice command, the method works 
well because the acoustic impulse responses hi and h2 are similar to propagation delays^ 
which means that all microphones receive essentially the same sound as input. If there is a 
strong reverberation from e.g. a nearby wall to a microphone at certain frequencies, the 
method may work less well. Pathological frequency regions can be discarded by modifying 
the algorithm, by using equation [8] in place of equation [6] : 



[8] 

The sum in equation [8] is taken over a number i of frequency intervals 
[^M^m] » ^ which e.g. no spuriously large values of (Q^) occur. If the sum covers 
enough frequencies Q^^ , <i„ is stiU a reliable measure of the relative s^itivity of the m-th 
microphone. iV, is the total number of frequencies in all the intervals [Ar^,fc,+i ] together. To 
increase accuracy, it is advantageous to also drop the lowest and highest frequencies from the 
siEamation, since some of the microphones can have a spurious behavior in these frequency 
regions. 

Fig. 4 shows a microphone calibration apparatus of the prior art. An electrical 
loudspeaker audio signal e is sent from a signal source 304 to loudspeaker 301, in which it is 
converted to soimd 302, which is picked up by microphone 303. Microphone 303 converts 
the sound to an electrical microphone audio signal s. In the known apparatus it is required 
that both tiie loudspeaker audio signal e and the microphone audio signal s, are sent to a 
processor 305, which is able to determine microphone sensitivity 307 from the two audio 
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signals. M tihe current invaition no loudspeaker audio signal e is required by the calibration 
algorithm. The iiq)ut of a sound source like speaker 201 is sufficient 

Fig. 5 shows an apparatus 401 for relative calibration of a first and a second 
microphone 403 and 405 according to the invention. A processor 407 has access to a first 
5 audio signal &om a first microphone 403 and a second audio signal fi-om a second 
microphone 405. It is possible to run an algorithm according to the invention, as e.g. 
illustrated with Fig. 3, on the processor 407, which e.g. calibrates the microphones 403 and 
405 after a certain amount of time to counteract time varying effects like e.g. con^onent 
aging or temperature related effects. Another option is that a user pushes e.g. a button 409 
10 and initiates the calibration, e.g. every time he takes the apparatus into a dif^ent room 
which has different acoustic impulse responses. 

An interesting option is to calibrate only wh^ the sound coming into the 
microphones is speech, by adding a speech detector. 

Fig. 6 shows a data carrier for storage of a computer program for execution on 
15 a processor describing a method according to the invention for calibrating a first and a second 
nuorophone. 

It shoiild be noted that the above-mentioned embodiments illustrate rather than 
limit the invention and that those skilled in the art are able to design altematives, without 
departing from the scope of the claims. Apart from combinations of elements of the mvention 

20 as combmed in the claims, other combinations of the elements within the scope of the 
invention as perceived by one skilled in the art are covered by the invention. Any 
combination of elements can be realized in a single dedicated element Any reference sign 
between parentheses in the claim is not intended for limiting the claim. The word 
"conqmsing" does not exclude the presence of elements or aspects not listed in a claim. The 

25 word "a" or "an" precedmg an element does not exclude flie presence of a plurality of such 
elements. 

The invention can be implemented by means of hardware or by means of 
software running on a computer. 




15 
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CLAIMS: EPO - DG 1 

iaoa2oo2 
@ 

I A method of calibrating a first miarophone and a second microphone, 

comprising 

an acquisition step in which a first mput audio signal is acquired by means of 
the first microphone and a second input audio signal is acquired by means of the second 
S microphone; 

a calibration step in which a first sensitivity of the first microphone and a 
second sensitivity of the second microphone is determined, 
characterized in that in the calibration step an algorithm is appUed which enables 
determination of Hie sensitivities, in the absence of a loudspeaker for generating the input 
10 audio signals. 

2. A method as claimed in claim 1, characterized hi that the first and the second 

ii^ut audio signal are processed by an ad^tive beamfonning filter, and the sensitivities are 
deteraiined by perfonning a calculation with weights of the ad^tive beamfotming filter. 



3 A method of calibrating a first and a.second.mi©cophone as claimed in.cl^ 2, 

characterized in that the algorithm comprises calculating 



I 



— (^*)f » ^ which JF" is the discrete Fourier transform of the weights of the 



beamforming filter after adaptation, and the sum ranges over a predetermined number L of 
20 frequencies £1^ . 

4. An apparatus comprising a first naicrophone and a second microphone for 

acquiring a first and a second input audio signal respectively, and a processor for detennining 
a first sensitivity of the first microphone and a second sensitivity of the second microphone, 
25 characterized in that the processor is able to determine the sensitivities, in the absence of a 
loudspeaker for generating the input audio signals. 



PHNL020831 



10 12.09.2002 
5. A conq)uter program for execution by a processor, describing a method of 

caKbrating a first microphone and a second microphone, which method comprises 

an acquisition step in which a first input audio signal is acquired by means of 
the first microphone and a second input audio signal is acquired by means of the second 
5 microphone; 

" a cahbration step in which a first sensitivity of the first microphoTie atiH a 

second sensitivity of the second microphone is determined, 
characterized in that in the caUbration step an algorithm is ^Hed which enables 
determination of the sensitivities, m the absaice of a loudspeaker for generatmg the input 
10 audio signals. 



15 



6. A data carrier storing a computer program for execution by a processor, 

describing a method of calibrating a first microphone and a second microphone, which 
method comprises 

an acquisition step in which a first input audio signal is acquired by means of 
flie first microphone and a second input audio signal is acquired by means of the second 
microphone; 

a calibration step in which a first sensitivity of the first microphone and a 
second sensitivity of the second microphone is determined, 
20 characterized in that in the calibration step an algorithm is applied which enables 

determination of the sensiti^dti^; fii the alis^ce of a loudspeaker for generating the ii^ut 
audio signals. 
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ABSTRACT: 



The mefliod for the relative calibration of at least two microphones (205, 207), 
is able to obtain the relative sensitivities (al, a2) of the microphones, without the need of a 
controlled sound input from a loudspeaker (301). For the determination of the sensitivities 
(al, a2) of the microphones, the calibration algorithm (CAL) uses tiie coefficients (wl, w2, 
5 w3) of a beamforming filter after ad^tation. 

Fig. 3. 
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